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Searching and fixating: scale-invariance vs. characteristic 
timescales in attentional processes 
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PACS 87 . 19 . It - Sensory systems 

Abstract. - In an experiment involving semantic search, the visual movements of sample popu- 
lations subjected to visual and aural input were tracked in a taskless paradigm. The probability 
distributions of saccades and fixations were obtained and analyzed. Scale-invariance was observed 
in the saccadic distributions, while the fixation distributions revealed the presence of a character- 
istic (attentional) time scale for literate subjects. A detailed analysis of our results suggests that 
saccadic eye motions are an example of Levy, rather than Brownian, dynamics. 
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While there has been extensive quantitative analysis of 
complexity in the natural world [1,2 , the quantitative 
analysis of complex cognitive systems is still in its infancy. 
' We know that our eyes respond to verbal influences by 
looking at an object of speech, or by imagining its visual 
analogues. Psychologists have carried out experiments on 
eye movements during reading [3HS] and scene perception 
[6l[7] and suggested that visual mechanisms can be broadly 
divided into saccades and fixations. During a saccade, the 
visual system is involved in search, and cognitive process- 
ing is minimal, while when it is fixated on an object, search 
is temporarily at an end, and cognitive processing is max- 
, imal. Recently, there has been some work on saccadic 
' processes in isolation (i.e. when fixations are not taken 
into account); however, their conclusions are often diver- 
gent [8,9. In fact, saccades and fixations correspond to 
changing states of attention in a dynamical system, so that 
a realistic quantitative approach would examine each one, 
as well as how each one affects the other; to the best of 
our knowledge, this has not been done prior to the present 
study. Additionally, subjects who are given an overt task 
to perform may respond differently from those who are 
tracking images through unconscious processing; the sec- 
ond process might yield insights that are more ' natural ' 
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and it is for this reason that the design of the experiment 
analyzed below involves a taskless paradigm. Last but not 
least, we examine the effect of literacy on our observations, 
since literacy may have a direct influence on attentional 
mechanisms affecting eye movement [TO] . 

The link between attention and overt eye movements 
has long been debated [11]; it is now generally accepted 
that attention is not focused on any object during sac- 
cades, as the system searches for its next fixation. For our 
purposes, since we are interested in quantifying frequen- 
cies of saccades and fixations during a language-mediated 
visual search, we assume that fixations indicate zones of 
attentional stability, while saccades involve random, rela- 
tively inattentive, search. Another issue involves the se- 
lection of a comprehensive sample, with varying literacy 
levels; for this, one needs to construct a task which re- 
lies as little as possible on formal education. The visual 
world eye-tracking paradigm [12] was used in an experi- 
ment whereby overt visual attention to different objects 
was measured when participants looked at a display and 
processed an auditory sentence about it. Participants are 
under no instructional control, and are fully unaware that 
they are performing a set task. Saccades and fixations dur- 
ing this natural search are influenced by both the emerg- 
ing visual and linguistic representations, and their inter- 
actions [13][T3]. We underline that no previous study has 
measured eye movements in illiterate subjects, with a view 
to examining their attentional responses. 
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Fig. 1: Example of a trial sequence: the sample stimulus shows 
the phonological and semantic competitors of a target word. In 
this case the auditory sentence was : Usne aaj taara dekha, (He 
saw a star today), where the phonological analogue to taara 
(star) was taala (lock), while the semantic analogue was the 
image of the sun. The other two images are distractors 



The display included four black and white line drawings 
of common objects. One of these objects was semantically 
related to the target spoken word in the auditory sentence 
while another was related phonologically, e.g. the words 
may have shared the first syllable. The other two objects 
were distractors. The participants included 36 illiterate 
and 32 literate subjects, selected from the Hindi-speaking 
population of Allahabad [15]. The spoken sentences were 
neutral, and the target word came after about 16 seconds 
on average from the sentence onset. Fig. [T] illustrates a 
sample display. The fixation and saccadic eye movement 
data were collected and recorded with an SMI High Speed 
eye tracker running at a sampling rate of 1250 Hz; this 
recovered the X and Y coordinates of a gaze with an ac- 
curacy of 0.01 degree, and the resulting data were binned 
using MATLAB. This was done for 35 trials per subject. 

Saccades were identified following a velocity-based al- 
gorithm when the movement of the eye was greater than 
30°/sec in any direction from its current position. Fixa- 
tions were identified with the stationarity of eye motion in 
any location, for a minimum duration of 80 msec occur- 
ring between two saccades. For each of the subjects, the 
data were manually scanned, and probability distributions 
were computed to show the frequencies of fixation/saccade 
durations. Overall, saccades appear to follow power-law 
distributions, P(t s ) = (t s ) - ^, where t s is saccadic time; 
averaging over all the subjects, we obtain the exponent 
f3 = —1.77 ±0.23, for both literate and illiterate subjects, 
with only about four of the illiterate subjects and one of 
the literate subjects showing significant deviations from 
such behavior. We observed however, that illiterate sub- 
jects tended to spend relatively larger amounts of time on 
saccadic movements, even when power-law behavior was 
apparent overall. In Fig. [2j we show the distributions of 
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Fig. 2: Upper: The probability distribution P(tf) of fixation 
time tf for a typical literate subject. The peak of the distribu- 
tion is at 179.12 ± 16.14 msec. Lower: The log of the probabil- 
ity distribution P(t s ) of saccadic time t s vs In (t s ) for a typical 
literate, showing a power law with exponent /3 = —1.67 ±0.14. 

fixations and saccades for a typical literate subject. 

The major difference between the two types appears 
in the distribution of fixations, which are marked by the 
presence of a characteristic time scale for the literate sub- 
jects (see Table [1]). A skew one-peaked distribution with 
a peak that occurs in the range 190-260 msec (see Table 
[1]), with long and occasionally complex tails, typifies the 
fixation distribution for literates; this suggests the pres- 
ence of a typical attention-related peak when the subjects 
find meaningful correlations between a target image and 
the spoken word. For the illiterates, the distributions are 
less uniform. While some of them are rather similar to the 
skew 1-peaked distribution found in the literate case, most 
of them seem to exhibit power-laws, with no characteristic 
timescales, so that no fixation peaks are manifested (see 
Table [1]): significantly, in the latter case, there are high 
occurrences of large saccadic times. This seems to suggest 
that illiterate subjects show poorer cognitive processing, 
searching extensively and rarely fixating on their targets. 
The distributions for such an extreme case are shown in 
Fig. [3l The position of the fixation (attentional) peak is 
absent, and no convincing line can be fitted to the data 
for the saccades either. 
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Table 1: Fixation time (in msec) distributions of 32 literates 
and 36 illiterates. Note that all 32 literates had fixation peaks, 
while only 5 out of 36 literates manifested fixation peaks. 



Literacy level 


Serial No. 


Peak value 


Literates 


6,9,21,22,25,29,30 


194.88 ± 12.92 


10,11,19,26,28,31 


221.36 ±7.12 


1,3,4,7,8,14,24 


234.29 ±3.05 


2,15,16,18,20,23,32 


243.84 ±2.62 


5,12,13,17,27 


259.67 ±9.66 


Illiterates 


5,4,10,14,23 


247 ±23.57 


Others 


did not fixate 
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In the remainder of this Letter, we focus on saccadic 
eye movement, which has attracted an immense amount 
of attention recently [14,16-18 . There is an ongoing con- 
troversy on whether saccadic dynamics can be character- 
ized as diffusive [8] or hyperdiffusive [9]. Traditional ap- 
proaches to cognitive science had modeled language com- 
prehension and visual search as ordinary diffusion [T91I20] ; 
cognition was there viewed as a short-memory diffusion 
process that propagated as a linear function of time or 
stimulus-set size [2TJ[22]. More recent results have found 
that strong temporal correlations exist in cognitive behav- 
ior [23], suggesting that hyper diffusion prevails. In their 
turn, hyperdiffusive processes may be classed as Gaussian 
or Levy, depending on whether the central limit theorem 
holds [25 ,26 ; this corresponds to whether or not it is tem- 
porally uncorrelated [27j[28]. Typically, a process x(t) is 
characterised by two exponents: the diffusion exponent J, 
obtained from x(t) ~ t 6 , and the Hurst exponent H, ob- 
tained from V(t) =< \x(t) -x(0)\ 2 >~ t 2H , where V(t) is 
the mean-squared displacement of the process x(t). When 
5 = H , the process is Gaussian; else, it is Levy [24U26] . 

In the following, we analyze the spatial positions of 
the gazes of 8 representative subjects, spanning literates 
and illiterates, with an aim to determining the nature of 
saccadic dynamics; our results are presented in Figs. |4] 
and [5j and summarised in Table El For each of these 
subjects, a time series x(t) is constructed from the Eu- 
clidean distances corresponding to the gaze coordinates 
at times t = 1,2, ...N during a given saccade. Subtra- 
jectories Zi(n) starting in the i th time bin and spanning 
a time duration n are then computed using the relation 
~ Xlj=i[ x (^ ± J ' ~ 1)] [9,29-31; as the name im- 
plies, the subtrajectories Zi(n) are thus the cumulative 
Euclidean distances spanned in time windows of length 
+ n — 1] during saccades. These subtrajectories form 
the basis of estimating the exponents S and H in what 
follows. 



The standard deviation analysis method of [29l|3T] pro- 
vides the basis of the calculation of the Hurst exponent H. 
Defining the standard deviation D(n) for subtrajectories 
of length n : 




Fig. 3: Upper: The probability distribution P(tf) of fixation 
time tf for an extreme illiterate. No characteristic peak is dis- 
cernible. Lower: The log of the probability distribution P(t s ) 
of saccadic time t s vs In (t s ) for an extreme illiterate, where no 
power law is discernible. No lines have been fitted to the data 
for this reason. 
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the Hurst exponent H is obtained from D(n) ex n 
[29U3T] . The exponent S is estimated using the diffusion 
entropy analysis (DEA) method of [29,30 ; this involves 
constructing the histograms of the end points of the sub- 
trajectories Zi(n). If Pi(n) is the probability that an end 
point of a subtrajectory Z(n) of duration n lies in the i th 
time bin, the Shannon entropy for a time duration n is 
expressible as: 



N- 



(2) 



The DEA is based on the assumption of stationarity for 
the probability distribution function (pdf) characterising 
any time series as a diffusion process [30 ,31]. If stationar- 
ity holds, the pdf for our time series x(t) can be written 
as p(x,t) = jfF(-jj). On going to the continuum limit 
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Table 2: Various exponents plotted for the sample subjects 
whose data are plotted in Figs. [Hand [5] 



ln(n) 



Literacy Level 


Serial No. 


H 


S 


a 


li 


Literates 


10 


0.92 


0.87 


1.46 


1.85 


21 


0.96 


0.90 


1.79 


2.75 


25 


0.97 


0.92 


1.94 


1.96 


31 


0.98 


0.91 


1.87 


2.35 


Illiterates 


11 


0.97 


0.90 


1.71 


2.81 


14 


0.98 


0.89 


1.78 


2.17 


24 


0.87 


0.81 


1.13 


2.14 


32 


0.98 


0.92 


1.87 


1.35 




ln(n) 



Fig. 4: Upper: Plot of In D(ri) vs. In n for subtrajectories Z(n) 
corresponding to 8 representative subjects during saccadic eye 
movement. The legend indicates the correspondence between 
the symbol and the type/serial number of the subject. The 
exponents H are found from the slope (see text) and tabulated 
in Table [21 Lower: Plot of Shannon entropies S(n) vs. In n 
for saccades corresponding to 8 representative subjects. The 
legend indicates the correspondence between the symbol and 
the type/serial number of the subject. The exponents 5 are 
found from the slope (see text) and tabulated in Table [21 



of the Shannon entropy 



/oo 
p(x, t)ln(p(x, t)) dx, 
-oo 



one gets S(t) = A + 5ln(t), which yields for discrete time 
durations n: 



S(n) = A + Sln(n), 



(3) 



where A is some constant. Eqs. [2] and [3] were used to 
plot Fig. [4j the slopes of the lines in the upper and lower 
plots respectively yielded the values of the Hurst exponent 
H and S for eight representative subjects, which were then 
tabulated in Table O 

The values of H (H > 0.5 indicates hyperdiffusion) 
obtained and the fact that S ^ H across Table [2] both 
suggest that our data are consistent with Levy processes. 
Table [2] also contains values of the exponent \i obtained 
from P(x) = for each subject; the /i values so ob- 
tained independently confirm Levy- like behavior [291 130]. 



Finally, another confirmation of hyperdiffusion comes from 
our measurement of the power spectrum P(f) ~ l// a , 
plotted against frequency / for all 8 subjects (Fig. [5]). The 
resulting values of the exponent a (see Table [2]) all lie 
between 1 and 2, and are consistent with the nontrivial 
temporal correlations typical of hyperdiffusive dynamics 

Having shown the internal consistency of our results, we 
now report external support. The findings of Van Loon et 
al. [33] who found non-Gaussian behavior for the timing 
of sequence for the ' later ' saccades in a multiple-fixation 
search like our own, buttress our conclusions. They are in 
contrast to the findings of [8 , where fractional Brownian 
behavior was observed in the predictive eye movements of 
saccades; this is probably because, in our case, the experi- 
mental paradigm is not predictive at all, and participants 
are not aware that they are performing a task. Finally, 
our results support the findings of [3l] , where it was shown 
that fractional Levy-like dynamics (rather than ordinary 
Levy walks with H = 0.5) predominate in foraging, a situ- 
ation in which animals are typically unaware of where next 
they will find food. This is very similar to what obtains 
in our case: participants are unaware of the locations of 
their next targets, and accordingly, the Hurst exponent H 
takes on values that are much greater than 0.5 (see Table 
[2]). Our tentative suggestion is therefore that saccadic eye 
movements obey fractional Levy dynamics, irrespective of 
the literacy level of the subject. 

In conclusion, our most striking observation here is that 
there is a clear separation of behavior in visual process- 
ing, via the distributions of saccadic and fixation times; 
the former shows clear scale-invariance, while the latter 
does not. The effect of literacy shows up only in the fixa- 
tion dynamics, where literate subjects typically show the 
presence of an attentional timescale around 220 msecs on 
average. However, the saccades of both literate and il- 
literate subjects appear to be characterised by fractional 
Levy- like dynamics; we suggest that this might be expli- 
cable by analogies with foraging dynamics in animals [35] , 
since in both cases the subjects are completely unaware of 
the locations of their next targets. 
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Fig. 5: Plot of the power spectrum P(f) for eight subjects 
against frequency /. The legend indicates the correspondence 
between the symbol and the type/serial number of the sub- 
ject. The exponents a are found from the slope (see text) and 
tabulated in Table [2] 
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